CDS
Accession Number | TCMCG064C02945 |
gbkey | CDS |
Protein Id | XP_011096073.1 |
Location | complement(join(439947..439991,440401..440558,441268..441436,442306..442467,442642..442807,443413..443521,443599..443693,443762..443897,444021..444090,444262..444360,444466..444614,444957..445167,445248..445382,445633..445705,446284..446456,447298..447564)) |
Gene | LOC105175349 |
GeneID | 105175349 |
Organism | Sesamum indicum |
Protein
Length | 738aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA268358 |
db_source | XM_011097771.2 |
Definition | DNA mismatch repair protein MLH1 [Sesamum indicum] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGGATTTCGATATTGAGAACCCTGATCAAATGGATGTGGAAGAATTAGTTCCCAATCCAATTCACAGGGAACCACCTAGAATCCACCGTCTGGATGAGGCGGTGGTGAACAGAATCGCTGCCGGTGAGGTAATCCAACGGCCGGTCTCTGCCGTAAAAGAGCTCCTCGAGAATAGCATCGACGCTGATTCCAGCTCCATATCCGTCTTAGTTAAGGACGGCGGCCTCAAACTCATCCAAGTTTCGGATGATGGCCACGGAATTCGATATGAAGATCTGTCAATATTATGCGAAAGGCATACAACCTCTAAACTGAGTAAATTTGAGGACTTACTGTCTATCAAATCAATGGGGTTTAGGGGTGAGGCTTTGGCAAGTATGACCTATGTTGGTCATGTGACAGTCACCACAATTACCAAGGGTCAGTTGCATGGATACAGGGCGACTTATAAAGATGGTGTAATGGAGCATGAACCAAAAGCTTGTGCAGCTGTTAAAGGCACTCAAATCATGATCGAGAATTTATTTTATAACATGAGCGCTCGGAGGAAAACACTTCAGAATTCTGCGGATGACTATCCCAAAATAGTCGATTTAATTTGTCGGTTTGCCATCCATCACATAAGTGTGAACTTCTCTTGCAGAAAGCATGGAGCTGCTAGAGCAGATGTTCACTCAGTTGCTACCACCTCAAGGCTTGACACAATAAGATCTGTGTATGGGGTATCAGTTGCACAAAATCTTATGGAGATAGAAGTTTCAGAAGATGATCCTTCAAGTTCGATTTTTGAAATGGATGGCTTTATCTCAAATTCAAATTACATTGCAAAGAAGATAACAATGGTCCTTTTCATTAATGACAGGCTGGTGGAGTGCGGTGCTCTAAAGAGGGCAATTGAAATTGTTTATGCTGCAACTTTGCCGAAAGCATCAAAACCTTTCATCTACATGTCAATCAAATTGCCACCGGAGCACATTGATGTAAATGTACACCCGACAAAGAGAGAGGTGAGCCTCCTGAATCAAGAAGTTATAGTTGAGAAGATTCAATCTGCCATAGAGTCAAAATTGAGGAACTCCAATGAGTCTCGGACATTTCAGGAACAGAGGGTGGATCCTTCTCCATCTGTTTCTATTTCTATGAGCAAAGGATCCTCCAGTCATTCCTCATCCTCTGGATCAAAATCGCAAAAAGTTCCAGTGCAGAAAATGGTACGGACAGATTCACAGGATCCTGCAGGAAGGTTGCATGGATACTTGCAAGTTAAGCCATCTAGTCAACTACAAGGAAGTTCTCGCCTGGCGTCCATAAGGTCTGCAATCAGACAAAGGCGGAACCCAAGGGAAACTGCAGACCTTACTAGCATTCAGGAACTTATCAGAGAGATTGATTCTAGCTGTCACTCTGAGTTGCTGGACATTGTTAGGAACTGCTCATATATTGGGATGGCAGATGATGTTTTTGCCCTGCTGCAGCACAACACCCACCTTTACCTTGCTAATGTGGTCAATTTGAGCAAAGAGCTGATGTACCAGCAAGTCTTACGGCGGTTTGCGCATTTTAGTGCAATTCAATTGAGTGATCCAGCTCCATTGCCAGAATTAATAATGCTGGCTTTGAAAGAGGAGGAATTGAATACAGAAGGCGATGAAAACAATGACCTGAAAGAAAAGATTGCAGAAATGAATACAGAAATGATCAAGCAGAAAGCAGAGATGCTGGAGGAGTACTTTGGAATTCATATCGATCCAAATGGTAATTTGTCCAGGCTACCCATCGTACTCGACCAATACACACCCGATATGGATCGCGTTCCAGAGTTTGTTCTCTGCTTGGGCAATGATGTTAATTGGGATGATGAGAAAATTTGTTTTCAAACTATTGCTGCTGCCATCGGGAATTTTTATGCTTTACATCCACCGCTGTTGCCTAATCCATCTGGCGATGGCATGCAATTTTATCAGAGAGTGCCTTCTCGTACTCCTGAAGAAGGAGATGCCTCAAAAAGTGCTGACGATGTAAACAAGGATGAAGTTGAGCATGAGCTACTTTTGGAGGCTGAAAATGCTTGGGCTCAGCGTGAATGGTCAATACAGCATGTGTTGTTCCCCTCCATGCGACTTTTCCTCAAGCCTCCAACTTCAATGGCTGCAAATGGAACATTTGTCAAGGTTGCATCGTTGGAGAAACTCTATAAAATCTTTGAGAGATGCTAA |
Protein: MDFDIENPDQMDVEELVPNPIHREPPRIHRLDEAVVNRIAAGEVIQRPVSAVKELLENSIDADSSSISVLVKDGGLKLIQVSDDGHGIRYEDLSILCERHTTSKLSKFEDLLSIKSMGFRGEALASMTYVGHVTVTTITKGQLHGYRATYKDGVMEHEPKACAAVKGTQIMIENLFYNMSARRKTLQNSADDYPKIVDLICRFAIHHISVNFSCRKHGAARADVHSVATTSRLDTIRSVYGVSVAQNLMEIEVSEDDPSSSIFEMDGFISNSNYIAKKITMVLFINDRLVECGALKRAIEIVYAATLPKASKPFIYMSIKLPPEHIDVNVHPTKREVSLLNQEVIVEKIQSAIESKLRNSNESRTFQEQRVDPSPSVSISMSKGSSSHSSSSGSKSQKVPVQKMVRTDSQDPAGRLHGYLQVKPSSQLQGSSRLASIRSAIRQRRNPRETADLTSIQELIREIDSSCHSELLDIVRNCSYIGMADDVFALLQHNTHLYLANVVNLSKELMYQQVLRRFAHFSAIQLSDPAPLPELIMLALKEEELNTEGDENNDLKEKIAEMNTEMIKQKAEMLEEYFGIHIDPNGNLSRLPIVLDQYTPDMDRVPEFVLCLGNDVNWDDEKICFQTIAAAIGNFYALHPPLLPNPSGDGMQFYQRVPSRTPEEGDASKSADDVNKDEVEHELLLEAENAWAQREWSIQHVLFPSMRLFLKPPTSMAANGTFVKVASLEKLYKIFERC |